Mining Visual Actions from Movies
نویسندگان
چکیده
This paper presents an approach for mining visual actions from real-world videos. Given a large number of movies, we want to automatically extract short video sequences corresponding to visual human actions. First, we find commonly occurring actions by mining verbs extracted from movie transcripts. Next, we align the transcripts with the videos using subtitles. We then retrieve video samples for each action of interest. Not all of these samples visually characterize the action. Therefore, we propose to rank the retrieved videos by visual consistency. We first explore two unsupervised outlier detection methods: one-class Support Vector Machines (SVM) and finding the densest component of a similarity graph. As an alternative, we show how to obtain and use weak supervision. We investigate a direct use of binary SVM and propose a novel iterative re-training scheme for Support Vector Regression machines (SVR). Experimental results explore actions in 144 episodes of the TV series Buffy the Vampire Slayer and show: (a) the applicability of our approach to a large set of real-world videos, (b) how to use visual consistency for ranking videos retrieved from text, (c) the added value of random nonaction samples, i.e., the importance of weak supervision and (d) the ability of our iterative SVR re-training algorithm to handle mistakes in the weak supervision. The quality of the rankings obtained is assessed on manually annotated data for six different action classes.
منابع مشابه
Word of Mouth in Online Social Network, Using Twitter to Predict Box Office Revenue
In this paper, we study the economic impact of user generated contents (UGC) in twitter on movies’ box offices. In contrast to other UGC such as online reviews, twitter more truthfully documents a product’s word of mouth (WOM) among ordinary consumers. We develop a new text mining technique to extract and analyze the movie conversations among twitter users. This mining technique is specially-de...
متن کاملTime Based Context Cluster Analysis for Automatic Blog Generation
This paper describes the algorithms developed to identify the actions of a community of users. Actions are detected through the processing of raw context data acquired from the users’ smartphones or PDAs by our context awareness platform. Data mining techniques, and in particular cluster analysis, allow us to discover high level information, like the actions performed by the subjects during eac...
متن کاملThe visual analysis of emotional actions.
Is the visual analysis of human actions modulated by the emotional content of those actions? This question is motivated by a consideration of the neuroanatomical connections between visual and emotional areas. Specifically, the superior temporal sulcus (STS), known to play a critical role in the visual detection of action, is extensively interconnected with the amygdala, a center for emotion pr...
متن کاملThe Use of a Cultural Protocol for Quantifying Cultural Variations in Verb Semantic between Chinese and French
In this methodological investigation, we examined the influence of cultural background on viewers’ interpretations of visual stimuli and verbs elicited by these materials. French and Mandarin native speakers’ interpretations of seventeen short movies, produced by French speakers, depicting various state-changing actions were collected by a 25-item cultural protocol. A slight difference in the f...
متن کاملMining Web Graphs for Recommendations
By increasing various contents in web, recommendation by user also be increased as well as techniques for those maintenance also be very necessary. Various recommendations on web are movies, news, music, books and images, etc. Number of recommendations is search by all around world. These data recommendation modeled in data resources using different types of graphs. But these methods are less s...
متن کامل